jMOTU and Taxonerator: Turning DNA Barcode Sequences into Annotated Operational Taxonomic Units
نویسندگان
چکیده
BACKGROUND DNA barcoding and other DNA sequence-based techniques for investigating and estimating biodiversity require explicit methods for associating individual sequences with taxa, as it is at the taxon level that biodiversity is assessed. For many projects, the bioinformatic analyses required pose problems for laboratories whose prime expertise is not in bioinformatics. User-friendly tools are required for both clustering sequences into molecular operational taxonomic units (MOTU) and for associating these MOTU with known organismal taxonomies. RESULTS Here we present jMOTU, a Java program for the analysis of DNA barcode datasets that uses an explicit, determinate algorithm to define MOTU. We demonstrate its usefulness for both individual specimen-based Sanger sequencing surveys and bulk-environment metagenetic surveys using long-read next-generation sequencing data. jMOTU is driven through a graphical user interface, and can analyse tens of thousands of sequences in a short time on a desktop computer. A companion program, Taxonerator, that adds traditional taxonomic annotation to MOTU, is also presented. Clustering and taxonomic annotation data are stored in a relational database, and are thus amenable to subsequent data mining and web presentation. CONCLUSIONS jMOTU efficiently and robustly identifies the molecular taxa present in survey datasets, and Taxonerator decorates the MOTU with putative identifications. jMOTU and Taxonerator are freely available from http://www.nematodes.org/.
منابع مشابه
A DNA-Based Registry for All Animal Species: The Barcode Index Number (BIN) System
Because many animal species are undescribed, and because the identification of known species is often difficult, interim taxonomic nomenclature has often been used in biodiversity analysis. By assigning individuals to presumptive species, called operational taxonomic units (OTUs), these systems speed investigations into the patterning of biodiversity and enable studies that would otherwise be i...
متن کاملmPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences
BACKGROUND Formation of operational taxonomic units (OTU) is a common approach to data aggregation in microbial ecology studies based on amplification and sequencing of individual gene targets. The de novo assembly of OTU sequences has been recently demonstrated as an alternative to widely used clustering methods, providing robust information from experimental data alone, without any reliance o...
متن کاملUse of DNA barcode in the identification of fish species from Ribeira de
Species identification is a difficult task, ranging from the definition of the species concept itself to the definition of the threshold for speciation. DNA Barcode technology uses a fragment of the Cytochrome Oxidase I (COI) gene as a molecular tool that many studies have already validated as a tool for species identification. DNA barcode sequences for COI were generated and analyzed from 805 ...
متن کاملPotential core species and satellite species in the bacterial community within the rabbit caecum
A bacteria library was constructed from the caecum of a rabbit maintained under standard conditions. The complete gene 16S rRNA gene was sequenced. The 228 clones obtained were distributed in 70 operational taxonomic units (OTUs). The large majority of the OTUs were composed of one or two clones and seven OTUs contained half of the sequences. Fourteen sequences had high similarity to the sequen...
متن کاملMicrobial diversity of soils on the banks of the Solimões and Negro rivers, state of Amazonas, Brazil
Analysis of bacterial diversity in soils along the banks of the Solimões and Negro rivers, state of Amazonas, Brazil, was by partial sequencing of the genes codifying the rDNA16S region. Diversity of operational taxonomic units (OTU) and of the divergent sequences obtained were applied in comparative analysis of microbiological diversity in the two environments, based on richness estimators and...
متن کامل